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WE CLAIM: 

1 . A method for identifying a drug discovery target which comprises: 

(a) providing a means for storing and accessing genomics information in a 
database wherein said means permits computational analysis of biological relationships 
among the stored concepts; 

(b) querying the database to identify a disease-related pathway; and 

(c) identifying the biological interactions and actor concepts in the disease- 
related pathway whereby each of the actor concepts involved in each such reaction is a 
drug discovery target. 

2. The method of claim 1 wherein the genomics information comprises information 
relating to genes, their DNA sequences, mRNA, the proteins that result when the genes 
are expressed, and the biological effects of the expressed proteins. 

3. The method of claim 2 wherein the data comprise data extracted from multiple 
public sources. 

4. The method of claim 2 wherein the data comprises proprietary data, 

5. The method of claim 2 wherein the data comprises data extracted from a 
combination of proprietary and public data sources. 

6. The method of claim 2 wherein the data comprises data extracted from a 
combination of proprietary and public data sources. 

7. The method of claim 2 wherein the means for storing the genomics information 
includes an ontology in which: 

(a) each gene, gene product, and biological effect is given an identifier which 
is related to synonyms for the identifier; 

(b) each gene, gene product, and biological effect is categorized by class; and 
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(c) the relationship of each gene, gene product and disease state is defined by 
slots and facets. 

8. The method of claim 2 wherein the candidate drug discovery targets in the disease 
related pathway are prioritized based on factors that include function and complexity. 

9. The method of claun 8 wherein the candidate drug discovery targets are further 
prioritized based on markers for side effects and patient responsiveness. 

10. The method of claim 2, further comprising combining the results of querying the 
database with the results of additional data obtained from one or more additional methods 
for identifying candidate drug discovery targets. 

1 1 . The method of claim 10 wherein the additional data is obtained from one of 
protein-protein interaction studies and protein profiling from mass spectrometry. 

1 2. The method of claim 1 1 wherein the additional data is obtained from differential 
gene expression studies. 

13. The method of claim 1 wherein the genomics information comprises information 
relating to genotype and the disease-related pathway comprises a gene or gene product 
associated with a particular genotype. 

14. The method of claim 1 wherein the genomics information comprises the name of 
each gene, gene product, and their biological effects, and the means for storing and 
accessing the genomics information identifies relationships that are at least one step 
removed. 

1 5. The method of claim 1 , wherein the identifying the biological interactions step 
includes the step of comparing genomics data from the database to user-defined data 
using a statistical model. 
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1 6. The method of claim 1 5, wherein the comparing step includes identifying an 
overlap between user-defined data and data from the database and the statistical model is 
a statistical significance model measuring the likelihood that the overlap is a random 
event. 

17. The method of claim 16, wherein the user-defmed data is one of gene expression 
data and a manually entered gene list. 

1 8. The method of claim 1 , wherein the identifying the biological interactions step 
includes classifying one or more relevant findings using an ontology. 

1 9. The method of claim 1 8, wherein the classifying one or more findings using an 
ontology includes determining a likelihood that the one or more findings residing in a 
particular biological classification in the ontology is statistically significant. 

20. The method of claim 1, wherein the querying the database step includes the step 
of generating profiles according to one or more criteria. 

21 . The method of claim 20, wherein the profiles are pre-generated from the database. 

22. The method of claim 20, wherein the profiles are generated by one of a data- 
driven and model-driven approach. 

23. The method of claim 20, wherein the profiles are gene-centric being derived about 
a central gene for all genes in the database. 

24. The method of claim 20, wherein the profiles are at least one of process, function 
and disease centric being derived about a central biological process for all processes in 
the database. 
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25. The method of claim 20, wherein the profiles are at least one of tissue, organ or 
structure centric being derived about a central physical object for all objects in the 
database. 

26. A method for evaluating user-supplied genomics data using a stmctured database 
that permits the computation of complex relationships among genes and/or gene products 
contained in the database, comprising: 

(a) defining a profile model based on one or more profile definition criterion; 

(b) building a collection of profiles according to the profile model; 

(c) identifying one or more profiles that overlap at least a portion of the user-supplied 
genomics data and determining, for each such overlapped profile, whether the overlap is 
statistically significant; and 

(d) analyzing one or more statistically significant profiles together with the user- 
supplied genomics data including inspecting database-asserted biological interactions 
embodied in the one or more statistically significant profiles. 

27. The method of claim 26, fiirther including the step of pre-generating a profile 
library containing a profile for each one of a genomic information type in the database 
according to the profile model. 

28- The method of claim 27, wherein the profiles are pre-generated from a graph 
structure. 

29. The method of claim 26, further comprising the step of generating profiles by 
querying the database for information matching the one or more profile definition 
criterion. 

30. The method of claim 28, wherein the determining statistical significance step 
includes the step of computing a probability of overlap as a function of information 
contained in the database. 
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3 1 . The method of claim 26, wherein the genomic infomiation type is one of a gene, 
gene product and biological process. 

32. The method of claim 26, wherein the user-supplied genomics data is differential 
gene expression data and the analyzing step further includes one of the steps of: 

(1) identifying a new use for a known therapy wherein the gene expression data 
relates to a pathway affected by the known therapy; 

(2) prioritizing candidate development compounds for further development wherein 
the gene expression data relates to the target of one or more candidate development 
compounds and the analyzing step includes giving higher priority to development 
compounds on the basis of whether or not they are likely to result in an undesirable effect 
based on their involvement in other biological pathways as embodied in the profile; and 

(3) identifying disease-related pathways wherein the disease is a side effect of drug 
therapy, wherein the gene expression data relates to the target affected by the drug 
therapy and the alternative pathways that are also affected by the drug or the drug 
discovery target and that result in an undesirable phenotype are embodied in the profile. 

33. The method of claim 26, wherein the genomics data is differential gene expression 
data relating to particular disease, and wherein the analyzing step further includes the step 
of validating whether the gene expression data are genotypic markers for the disease state 
according to whether a database-asserted biological association related to the disease 
state, which is shared among a plurality of overlapped profiles, is statistically significant. 

34. The method of claim 26, wherein the profile generation criterion include one or 
more of a biological process, number of genes, organismal, gene connectivity, edge 
connectivity, findings source type, experiment context, and tissue consistency criterion. 

35. The method of claim 26, wherein the profiles are generated from a seed node and 
the inspecting database-asserted biological interactions step focuses on the biological 
interactions emanating from the seed node. 
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36. The method of claim 35, wherein the seed is one of a gene, gene product and 
biological process genomic data type. 

37. The method of claim 26, further comprising computing a statistical significance 
for a biological association in the one or more statistically significant profiles. 

38. The method of claim 26, wherein the generating a profile library step includes, for 
each profile generated, the step of selecting a node for a profile based on the number of 
similar findings in the database that link the node to a neighboring node. 

39. The method of claim 26, further comprising the step of displaying information 
related to the one or more statistically significant profiles and genomics data using a GUI. 

40. The method of claim 26, fiirther including the step of annotating the profiles with 
biological associations asserted by the database including one or more of a cellular 
process, molecular process, organismal process and disease process. 

41 . The method of claim 40, further including the step of displaying biological 
association using one of a GUI and a report. 

42. The method of claim 40, wherein the annotation of profiles includes using 
classification information found in an ontology. 

43. The method of claim 26, wherein the determining of statistical significance test 
includes testing a null hypothesis over a discrete probability distribution, the distribution 
being a function of the database size, profile sizes, the user-supplied genomics data size 
and expression values. 

44. The method of claim 26, wherein the generating step includes generating a 
plurality of profile libraries, each of which corresponding to a different one of a plurality 
of profile generation criterions. 
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45. A method for identifying a new use for a known therapy comprising the steps of 
providing a means for storing and accessing genomics information in a database wherein 
said means permits computational analysis of biological relationships among the stored 
concepts; querying the database to identify a disease-related pathway comprising a 
known therapy target; selecting at least one of such disease-related pathways wherein the 
known therapy target is also comprised within a second disease-related pathway; and 
identifying treatment of the second disease as a new use for the known therapy. 

46. A method for prioritizing candidate development compounds for further 
development that comprises the steps of providing a means for storing and accessing 
genomics information in a database wherein said means permits computational analysis 
of biological relationships among the stored concepts; querying the database to identify 
all pathways associated with the target of each candidate development compound and 
assigning higher priority to development compounds on the basis of whether or not they 
are likely to result in an undesirable effect based on their involvement in other biological 
pathways. 

47. A method for identifying disease-related pathways wherein the disease is a side 
effect of drug therapy that comprises the steps of providing a means for storing and 
accessing genomics information in a database wherein said means permits computational 
analysis of biological relationships among the stored concepts; identifying the disease- 
related pathway affected by a drug or drug discovery target; querying the database to 
identify alternative pathways that are also affected by the drug or the drug discovery 
target and that result in the undesirable phenotype. 

48. A method for identifying or validating a genotypic marker for a disease state that 
comprises providing a means for storing and accessing genomics information in a 
database wherein said means permits computational analysis of biological relationships 
among the stored concepts; querying the database to identify a genotypic marker that is 
associated with a disease state. 
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49, A method for identifying a drug discovery target which comprises querying a 
computer system to identify a disease-related pathway and identifying the biological 
interactions and actor concepts in the disease-related pathway whereby at least one of the 
actor concepts involved in each such reaction is a drug discovery target and wherein the 
computer system comprises a means for storing and accessing genomics information in a 
database and a means for computational analysis of biological relationships among the 
stored concepts. 

50. A method of conducting business that comprises receiving compensation from a 
customer in return for identifying to the customer a drug discovery target discovered by 
querying a computer system to identify a disease-related pathway and identifying the 
biological interactions and actor concepts in the disease-related pathway whereby each of 
the actor concepts involved in each such reaction is a drag discovery target and wherein 
the computer system comprises a means for storing and accessing genomics information 
in a database and a means for computational analysis of biological relationships among 
the stored concepts. 

51 . A drag discovery target identified by the process of claim 1 . 

52. A method of drag discovery that comprises: 

(a) identifying a drag discovery target discovered by querying a computer system to 
identify a disease-related pathway and identifying the biological interactions and actor 
concepts in the disease-related pathway whereby one or more of the actor concepts 
involved in each such reaction is a drag discovery target and wherein the computer 
system comprises a means for storing and accessing genomics information in a database 
and a means for computational analysis of biological relationships among the stored 
concepts; and 

(b) screening compotmds against a drag discovery target to identify drag candidates. 

53. A drag candidate identified by the process of claim 52. 
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